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1 COCGTMTCT 

51 umiiaui 

101 CMXMMCB 

151 QCCACCAOT 

201 ATCCTCnK 

251 CTTQCATCA 

301 CGGGQQGPC 

3S1 CPGM^TMX 

401 WiMtXCQ 

461 (XKWXC^ 

501 oGcmcccc 

551 GQOOCOCAOC 

501 laCWCCTA 

651 cncccwfA 

701 ccwccoxc 

751 QCTTCCWTC 

eOl TKTCCCTOC 

851 TAMHACAA 

901 TAOXrrAQG 

%l CTTMCATTC 

1001 nMTICATA 

1061 rOQCCCGCCT 

1101 TCACCTATGI 

nSl TQQCIlXMa 

1201 TLHTATOXA 

1251 CCTQCSCAnA 

1301 TAaTCTACC 

1351 CTACATCAAT 

1«1 TCCACCCCAT 

1451 wcmccAA 

1501 MFTCCrax 

1551 QCAKTCCTT 

IGOl caQGCAoa 

1651 AGCTTQACA 



GATCAMMC 
OGATNXAA 
CAAUACTCT 
CAGTGKTQC 
AGMXATAGT 
GTOXACAC 
TACAXGTGA 
GftCAXTATC 
GCTTCCACX3G 

AcaawCT 

QAKSGAAAA 
ACACTTTACA 
MXOQCAOS 
AAAQGGNXA 
OKCCCOC 
nGTGTCTTC 
CAACQCAAOG 

ccTTrnxoc 

ATTATTGACT 
GCOCATATAT 
GQCTGKjOX 
nXCATACTA 
ATTTACXTA 
ACTACQCCCC 
TCCCCACTAC 
TATTAGTCAT 
OQKGTOWT 
TOCGTCAAT 
WTGTCGTAA 

TTTOCaGTA 

ancGCTAA 

CATCCCTCGA 

FiG. 



AACAAAAAAA 
TACCA^TCT 
ATACTCTCa 
CTAOXCQC 
IQCCKTOOC 
TACCQGATAA 
CCCAOCnOG 
OCTATOGAA 
COGTAACCCC 
QOWACGCa 
TGAGCGTCGA 
ACQCyOCAA 
GCOaOCCTC 
CCAGTAAAAG 
GACTCTAAAT 
CACCATOCT 

OTGACCGAC 
TQCTTCCCGA 
AfiTTAnAAT 

QCAcncoz: 

CCAACGACa 
ACOOCAATAG 
AACTCCOX 
aATTCACGT 
ATCACCHAT 
COaATTtfC 

QOXTTTCT 
CAAaCCQCC 
GACTOOCGU 

cnoocTcra 

GAGMACC 
CaCOGGGA 

. 1 



CCACCCCTAC 
TTTTCCCAW5 
laAGTGTAC 
aACATACa 
GATAACTCCT 
QQQ(3CAQCGG 
M3JWCQK 
AQCQCCACGC 
CACJQCTCOGA 
OGTATCmA 
mnCTCAT 
CQCCGAAHa 
ATTTCATATC 
CATTACCCCT 
CTGCCCTCAT 
nCAAAAGTC 
/SCT4GTGCCC 
AAnCCATCA 
TCTACGOXC 
ACTAATCAAT 

cn«aTAN: 

CXQCCCATTG 
GGHnTTOA 
F7Q0CACTAC 

CWTCACGGT 

0(»crrTcc 

ATIGGTGATOC 
CTCACCGGCA 

moocACo 

CCATTGACGC 
CCCTCAGATC 
CTOGTTAGAC 
CAQGCnAA 
TCTTCCATAC 



CAOCGGTOT 
GACTOQCTT 
OCCTACTTAG 

coaaocTA 

GTCTTACCQG 
TCQQQCTGAA 
OACACCWA 

rrcccwoG 

ACMSAGKX 
TACTCCTCTC 
OCTCCTCWG 
CCOCOCTCTT 
ATOCOXCCG 
QGTGOQGTTC 
CCACncCAA 
CGAA«>A7C 
GAGTAAAATT 

AGAAiaoa 

AGATATACa 
TACQOGGTCA 
n/COGTAAA 
ACCTCAATAA 
rTGACGTCAA 
ATCAACICTA 
AAATOOCCCC 
TACnOQCAC 

ocmroocA ■ 

mCCAAGTC 

aaat(av:gg 
aaat(x3qcgg 

aCCATATAA 
CACATQCAC 
CCnCAATAA 
aACCACTTC 
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1701 TGCQCaOCA GGTCGCGGCC GCGACTCTAG AGGATCTTTG TGWGGAACC 

1751 TTACrraGT GGTGTGACAT AAHGGACAA ACTACCTAa GAGATTTAAA 

1801 OaaAAGGT AAATATAAAA TTTTTAAGTG TATAATGTGT TAAAQACTG 

1851 AnaAATTG TrTGTGTAlT TTAiGAnCCA ACaATQGAA CTGATGAATG 

1901 GGAGCAGTGG TQGAATGCCT TTAATGAGGA AAACaGTTT TGCTCAGAAG 

1951 AAATGCCATC TAGTGATGAT GAGGCTAaC aGAQCTCA ACAHaACT 

2001 CaCCAAAAA AGAAGAGAAA GGTAGAAGAC CCCAAGGACT TTCOTCAGA 

2051 ATTOaAAGT TTTTTGAGTC ATGCTGTGTT TAGTAATAGA ACTGTGCn 

2101 GCTTTGCTAT HACACCACA AAGGAAAAAG CTGCAQGCT ATACAAGAAA 

2151 AHAIGGAAA AATATTCTGT AACOTTATA AGTAGGCATA ACAGTTATAA 

2201 TCATAACATA CTGTTTnTC TTACTCCACA CAGGCATAGA GTGTCTGaA 

2251 TTAATAAaA TCaCAAAAA TTGTGTACCT TTAGCmTT AATTTGTAAA 

2301 GGGGHAATA AGGAATATTT GATGTATAGT GCCTTGACTA GAGATCATAA 

2351 TCAGCCATAC CACATTTGTA GAGGTTTTAC nGCTTTAAA AAACCTCCCA 

2401 CACaCCCCC TGAACCTGAA ACATAAAATG AATGCAATTG ncnGITAA 

2451 CTTGnTAn GCAGCTTATA ATGGTTACAA ATAAAGCAAT AQCATCACAA 

2501 A7TTCACAAA TAAAGCATTT TriTCAaGC ATTCTAGTTG TOGTHGTCC 

2551 AAAQCATQ ATGTATCTTA TCATGTCTGG ATCaCTGGA ATGTGTGTCA 

2601 GTTAGGGTGT GGAAAGTCCC CAGGCTCCCC AGCAGGCAGA AGTATGCAAA 

2651 GCATGCATCT CAATTAGTCA GCAACCAGGT GTGGAAAGTC CCCAGGQCC 

2701 CCAGCAGGa GAAGTATGCA AAGCATGCAT CTCAAHAGT CAGCAACCAT 

2751 AGTCCCGCCC QAAaCCGC CCATCCCGCC CaAAQCCG CCCAfinCCG 

2801 CCCAnaCC GCCCCATQGC TGAaAATTT TTnTATTTA TGCAGAGGCC 

2851 GAGGCCGCa CGGCaCTGA GQAHCCAG AAGTAGTGAG GAGQCTTTTT 

2901 TQGAGGCCTA QGCnTTGCA AAAAGCTaat TC 

FIG. 1 ('^«''*> 
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Description 

Baclcground 

5 A basic tool in the field of recombinant genetics is the conversion of poly( A)+ mRNA to double-stranded (ds) cDNA, 

which then can be inserted into a cloning vector and expressed in an appropriate host cell. Molecular cloning nnethods 
for ds cDNA have been reviewed, for example, by Willlmas. "The Preparation and Screening of a cDNA Clone Bank, 
■ in Williamson, ed., Genetic Engineering, Vol. 1, p. 2, Academic Press, New York (1981); f\/lanlatis, 'Recombinant 
DNA", in Prescott, ed., Ceil Bloiogy, Academic Press, New York, (1980); and Efstratiadis et al., "Cloning of Double- 

10 Stranded DNA," in Steio et al., Genetic Engineering, Voi. 1, p. 15, Plenum Press, New York (1979). 

A substantial number of variables affect the successful cloning of a particular gene and cDNA cloning strategy 
thus must be chosen with care. A method common to many cDNA cloning strategies involves the construction of a 
"cDNA library" which is a collection of cDNA clones derived from the total poly(A)+ mRNA from a cell of the organism 
of interest. 

IS A mammalian cell may contain up to 30,000 different mRNA sequences, and the number of clones required to 

obtain low-abundance mRNAs, for example, may be much greater. Methods of constructing genomic eukacyotic DNA 
libararies in different expression vectors, including bacteriophage A, cosmids, and viral vectors, are knwon. Some 
commonly used methods are described, for example, in Maniatis et al., Molecular Cloning: A laboratory Manual, Cold 
Spring Harbor Laboratory, publisher. Cold Spring Harbor, New York (1982). 

20 Once a genomic cDNA library has been constructed, it is necessary to isolate from the thousands of host cells the 

cell containing the particular human gene of interest. Many different methods of isolating target genes from cDNA 
libraries have been utilized, with varying success. These include, for example, the use of nucleic acid probes, which 
are labeled mRNA fragments having nucleic acid sequences complementary to the DNA sequence of the target gene. 
When this method is applied to cDNA clones of abundant mRNAs in transformed bacterial hosts, colonies hybridizing 

2S strongly to the probe are likely to contain the target DNA sequence. The identity of the clone then may be proven, for 
example, by in situ hybridizatbn/selectbn (Goldberg etal.. Methods Enzymol., 68:206 (1979)) hybridarrested transla- 
tion (Paterson et al., Proceedings of the National Academy of Sciences. 74:4370 (1977)), or direct DNA sequencing 
(Maxam and Gilbert, Proceddings of the Natbnal Academy of Sciences, 74:560 (1 977)); Maat and Smith, Nucleic Acids 
Res. 5:4537(1978)). 

30 Such methods, however, have major drawbacks when the object is to clone mRNAs of relatively low abundance 

from cDNA libraries. For example, using direct in situ colony hybridization, it is very difficult to detect clones containing 
cDNA complementary to mRNA species present in the initial library population at less than one part in 200. As a result, 
various methods for enriching mRNA in the total population (e.g. size fractionation, use of synthetic oligodeoxynucle- 
otides, differential hybridization, or immunopurification) have been developed and are often used when low abundance 

3S mRNAs are cloned. Such methods are described, for example, in Maniatis et al.. Molecular Cloning: A Laboratory 
Manual, supra. 

Many functional eukaryottc proteins Initially exist in the from of precursor molecules which contain leader or signal 
sequences at their N-termlnal ends. These leader sequences bind to the cell membrane and draw the remainder of 
the protein through the lipid bilayer, after which the signal sequence is cleaved from the protein by a signal peptidase 

^0 enzyme. The protein thus functions only after secretion from the cells (for.example, insulin, serum albumin, antibodies, 
and digestive tract enzymes), or after the proteins have been anchored to the outer surface of a cell membrane (for 
examples, histocompatibility antigens). 

The cell surface antigens characteristic of mammalian T lymphocytes are additional examples of proteins that 
anchor to the cell surface. In mammals, certain cells derived from bone marrow mature into lymphocytes, which are 

45 present in the lymphoid organs, including the thymus, spleen, lymph nodes, and lymphoid aggregates, and also circu- 
late actively through the blood and lymph systems. Mature lymphocyte cells may be divided into two populations: 
thymus-dependent (T) lymphocytes and thymus-independent (B) lymphocytes. T lymphocytes migrate to the interior 
of the thymus, where they undergo differentiative proliferation. During their differentiation process, they express char- 
acteristic cell surface membrane alloantigens, including Thy-1 , TLA, gv-1 , Ly-1 , Ly-2, Ly-3, and Ly-5. As they mature, 

50 T lymphocytes lose the TLA antigens and some of the Thy-1 antigens, and gain histocompatibility antigens, acquiring 
the membrane confirmation typical of the recirculating T lymphoctyes. This is described, for example, by Mota, "Activity 
of Immune Cells," in Bier et aL, eds.. Fundamental of Immunology, 2d Ed, Springer- Verlag, Berlin, pp. 35-62 (1986). 

T lymphoctyes are involved indirectly in the formation of antibodies and their activities thus have required complex 
analysis of cell function, rather than simple antibody titer measurement. Partly due to this, their importance In devel- 

ss opment of immunologic competence was not recognized until relatively recently Mature T lymphocytes synthesize and 
express an unk^ue pattern of surface glycoprotein antigens which serve as markers for identification of different T 
lymphocyte subpopulations. including T helper cells, T suppressor celts, and T cytotoxk; celts. Each of these subpop- 
ulations plays a very important role in regulating the immune system. (Mota, supra). 
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In humans, the functional and phenotypic heterogeneity of T lymphocytes is well accepted. Two major subpopu- 
lations are known; effector T celts mediating cellular immunity; and regulator T cells containing helper and suppressor 
T lymphocytes. These two subpopulations have been defined with heteroantisera. autoantibodies, and monocbnal 
antibodies directed at cell surface antigens. For example, earlier in their development, human lymphoid cells in the 

s thymus express an antigen designated T11 which reacts strongly to a monoclonal antibody designated Cluster of 
Differentiation 2 (CD2). and react slightly with monoclonal antibody CDS to cell surface antigen T1. During maturation, 
these cells lose Til (CD2) and acquire three new antigens defined by monoclonal antibodies CD4, CDS. and CD1. 
With further maturation, the thymocytes cease to express cell surface antigens reactive with monoclonal antibody CD1 , 
express the T3 antigen reactive with rTKsnocbnal antibody CDS, and then segregate into two subpopulations which 

10 express either T4 (CD4) or T8 (CDS) antigen. Immunologic competence is acquired at this stage, but is not completely 
developed until thymic lymphocytes migrate outside the thymus. (Mota, supra.) In contrast with the majority of thymo- 
cytes, circulating T lymphocytes express the T1 (CDS) and T3 (CD3) antigens. The T4 (CD4) antigen is present on 
approximately S5-65 % of peripheral T lymphocytes, whereas the T8 (CDS) antigen is expressed on 20-30 %. These 
two subpopulations correspond to helper and to suppressor and cytotoxic T celts, respectively 

IS In addition to providing a convenient means of distiniguishing T lymphocyte subpopulations, these cell surface 

antigens are Important for mature T cell activation and effector function. T cell activation involves a complex series of 
cell surface interactions between the T cell and the target cell or stimulator cell in addition to binding of the T cell 
receptor to its specific antigen. 

For example, CD2, the human T cell erythrocyte receptor, allows thymocytes and T-lymphocytes to adhere to 

20 target cells (e.g. erythroyctes) and to thymic epithelium. This occurs via a specific molecular ligand for CD2, designated 
LFA-3, In humans, which is a widely distributed surface antigen. This phenomenon has long been employed to detect, 
assay and purify hurnan cells producing antibodies to sheep erythrocytes and serves as the basis for the E-rosette 
test, first decribed by Zaalberg, Nature, 202:1231 (1964). CD2/LFA-3 interactions also have been shown to mediate 
cytolytic target conjugation (Shaw et al., Nature, 323:262-264 (1986), and the mixed lymphocyte reaction (Martin et 

25 al., J. Immunol.. 1 31 :180-185 (1 983). Antl-CD2 nrK>noclonat antibodies can directly activate peripheral T-lymphocytes 
via an antigen-independent pathway (Meuer et al.. Cell, 36:897-906 (1 984), indicating an even wider Immunoregulatory 
role for CD2. 

Recognition that T lymphocytes are the main effectors of cell-mediated immunity and also are involved as helper 
or suppressor cells in modulating the immune response has resulted In a significant contribution to the increasing 

30 practical application of clinical immunology to medicine. The scope of this application includes defense against infec- 
tions, prevention of diseases by immunization, organ transplantations, blood banking, deficiencies of the immune sys- 
tem, and a variety of disorders that are mediated by immunologic mechanisms. Moreover, immunologic techniques 
frequently are used In the clinical laboratory, as In the measurement of hormones and drugs. Clinical immunology is 
described, for example, In Weir. ed. Handbood of Experimental Immunology in Four Volume 4: Application of Immu- 

35 nological Methods in Biomedical Sciences. 4th Ed., Blackwell Scientific Publications, Oxford (1986); Boguslaski et al, 
eds, Clinical Immunochmistry: Principles of Methods and Applications, Little, Brown & Co., Boston (1984); Holborow 
et all, eds. Immunology in Medicine: A Comprehensive Guide to Clinical Immunology 2d Ed., Grune & Stratton, London 
(1983); and Petersdorf et al., eds., Harrison's Principles of Internal Medicine, 10th ed.. McGraw-Hill, New York, pub- 
lisher, pp. 344-391 (1963). Clearly a more through understanding of the proteins which mediate the immune system 

40 wouki be of significant value in clinical immunology 

Use of mammalian expression libraries to isolate cDNAs encoding mammalian proteins such as those described 
above would offer several advantages. For example, the protein expressed in a mammalian host cell should be func- 
tional and should undergo any normal posttranslational modification. A protein ordinarily transported through the in- 
tracellular membrane system to the cell surface should undergo the complete transport process. A mammalian ex- 

45 pression system also would allow the study of intracellular transport mechanism and of the mechanism that insert and 
anchor cell surface proteins to membranes. 

One common mammalian host cell, called a "COS' cell, is formed by infecting monkey cells with a mutant viral 
vector, designated simian virus strain 40 (SV40), which has functional early and late genes, but lacks a functional origin 
of replication. In COS cells, any foreign DNA cloned on a vector containing the SV40 origin of replication will replicate 

50 because SV40 T antigen is present in COS cells. The foreign DNA will replicate transiently independently of the cellular 
DNA. 

With the exception of some recent lymphokine cDNAs isolated by expression in COS cells (Wong, G.G., et al., 
Science, 228:810-815 (1985); Lee, F et al.. Proceedings of the National Academcy of Sciences, USA. 83:2061-2065 
(1986); Yokota T, et al.. Proceedings of the National Academy of Sciences. USA, 83:5894-5898 (1986); Yang, Y, et 
55 all, Cell. 47:3-10 (1986)), however, few cDNAs in general are isolated from mammalian expression libaries. There 
appear to be two principal reasons for this: First, the existing technology (Okayama. H. et al., Mol. Cell. Biol. , 2: 1 61 -1 70 
(1 982)) for construction of large plasmid libraries is difficult to master, and library size rarely approaches that accessible 
by phage cloning techniques. (Huynh. T et al.. In.: DNA ck>ning Vol. I. A Practical Approach, Glover. D M. (ed.) IRL 
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Press, Oxford (1985), pp. 49-78). Second, the existing vectors are, with one exception (Wong, G.G. et al. Science, 
228:810-815 (1985)), poorly adapted for high level expression, particulary in COS cells. The reported successes with 
lymphokine cDNAs do not imply a general fitness of the methods used, since these cDNAs are particularly easy to 
isolate from expression libraries. Lymphokine bioassays are very sensitive (Wong, G.G., et al., Science, 228:810-815 

5 (1985); Lee, F. et al., Proceedings of the National Academy of Sciences, USA, 83:2061-2065 (1986); Yokota, T. et al., 
Proceedings for the National Academy of Sciencecs. USA, 83:5894-5898 (1986); Yand, Y et al., Cell; 47:3-10 (1986)) 
and the mRNAs are typically both abundant and short (Wong, G.G. et al., Science, 228:810-815 (1985); Lee, F. et al.. 
Proceedings of the National Academy of Sciences, USA, 83:2061-2065 (1986); Yokota T, et al.. Proceedings of the 
National Academy of Sciences, USA. 83:5894-5898 (1986); Yand. Y et al.. Cell, 47:3-10 (1 986)). 

10 Thus, expression in mammalian hosts previously has been most frequently employed solely as a means of verifying 

the identity of the protein encoded by a gene isolated by more traditional cloning methods. For example, Stuve et al., 
J. Virol., 61(2):327-335 (1987), cloned the gene for glycoprotein gB2 of herpes simplex type II strain 333 by plaque 
hybridization of M1 3-based recombinant phage vectors used to transform competent E. coli JM101 . The identity of the 
protein encoded by the clone thus isolated was verified by transfection of mammalian COS and Chinese hamster ovary 

IS (CHO) cells. Expression was demonstrated by immunofluorescence and radioimmunopreclpitation. 

Oshima et al., used plaque hybridization to screen a phage lamda gtl 1 cDNA library for the gene encoding human 
placental beta-glucuronidase. Oshima et al., Proceedings of the National Academy of Sciences, U.S.A., 84:685-689 
(1987). The identity of isolated cDNA clonses was verified by immunoprecipitation of the protein expressed by COS- 
7 cells transfected with cloned inserts using the SV40 late promoter. 

20 Transient expression in mammalian cells has been employed as a means of confirming the identity of genes pre- 

viously isolated by other screening methods. Gerald et al.. Journal of General Virology, 67:2695-2703 (1986). Mac- 
kenzie, Journal of Biological Chemistry, 261:14112-14117 (1986); Self et al., Gene, 43:1111-1121 (1986); Orkin el al.. 
Molecular and Cellular Biology, 5(4):762-767 (1985). These methods often are inefficient and tedious and require 
multiple rounds of screening to identify full-length or overlapping clones. Prior screening methods based upon expres- 

2S sion of fusion proteins are ineffecient and require large quantities of monoclonal antibodies. Such drawbacks are com- 
pounded by use of inefficient expression vectors, which result in protein expression levels that are inadequate to enable 
efficient selection. 

Summary of the invention 

30 

By means of the cloning method of the present invention, isolation and molecular cloning of the cell surface antigen 
GDI 9 has been accomplished. The nucleotkie sequence of the gene ctoned by the method of the present invention 
has been determined and the amino acid sequence of the encoded protein has been identified. The cloned gene is 
also the subject of the present invention. Once the gene encoding an antigen has been cloned according to the method 

35 of the present invention, that gene can be expressed in a prokaryotic or a eukaryotic host cell to produce the encoded 
protein or portion thereof in substantially pure form such as it does not exist in nature. Another aspect of the present 
invention relates to the substantially pure cell surface antigen. The primary amino acid sequences of the CD1 9 antigen 
has been detemined. The invention thus also relates to the amino acid sequences of the antigen and to the nucleotide 
. sequences encoding the antigen. The purified gene and proteins and protein of the present inventfon is useful for 

40 immunodiagnostic and immunotherapeutic applications, including the diagnosis and treatment of immune-mediated 
infections, diseases, and disorders in animals, including humans. It can also be used to kientify, isolate and purify other 
antibodies and antigens. Such diagnostic and therapeutic uses comprise yet another aspect of the present invention. 
Moreover, the substantially pure protein of the present invention may be prepared as medicament or pharmaceutical 
composition for therapeutic administration. The present invention further relates to such medicaments and composi- 

^5 tions. 

Brief Description of the Drawings 

Figure 1 . Nucleotide sequenqe of expression vector piH3 

so Nucleotides 1 -589 are derived from pMBI origin (pBR322 ori); nucleotides 590-597 are derived from the Sacll 

linker (ACCGCGT); nucleotides 598-799 are derived from the synthetic tyrosine suppressor tRNA gene (supF 
gene); nucleotides 800-947 are derived from a remnant of the ASV LTR fragment (Pvull to Mlul); nucleotides 
948-1500 are derived from the human cytomegalovirus AD 169 enhancer; nucleotides 1501 -1650 are derived from 
HIV TATA and tat-responsive elements; nucleotides 1651-1761 are derived from pILNXAN polylinker (Hindlll to 

ss XBa); nucleotides 171 7-2569 are derived from pSV to splice and poly-Addition signals; nucleotides 2570-29 17 are 

derived from the SV40 origin of replication (pvull to (Hindlll); and nucleotides 2918-2922 are derived from piVX. 
remnant of R1 site from polylinker. 
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Figure 3. Restriction map of the CDM8 expression vector 

The CDM8 vector includes a deleted version of a mutant polyoma virus early region selected for high efficiency 
expression in both murine and monkey cells. Substantially all of the human immunodefficiency promoter region 
has been replaced with the cognate sequences of the human cytomegalovirus immediate early promoter, and by 
5 inclusion of a bacteriophage T7 promoter between the eukaryotic promoter and the site of cDNA insertion. Arrows 

indicate the direction of transcription. 

Figure 5. Restriction hAap of the piH3M vector 

The direction of transcription is indicated by an arrow. Restriction endonuclease sites flanking the BstX1 cloning 
10 sites are shown. 

Figure 6. Nucleotide sequence of the piH3M vector 

There are 7 segments. Residues 1-587 are from the pBR322 origin of replication, 588-1182 from the Ml 3 
origin, 1 1 83-1 384 from the supF gene, 1 385-2238 are from the chimeric cytomegalovirus/human immunodeficiency 
IS virus promoter, 2239-2647 are from the replaceable fragment, 2648-3547 from plasmid pSV2 (splice and polya- 

denylation signals), and 3548-3900 from the SV40 virus origin. 

Figure 12. Nucleotide sequence of CD19 

20 Detailed Description of the Invention 

This invention relates to a novel method for cloning cDNA encoding a cell surface antigen and to a method of 
constructing cDNA libraries. It also relates to particular cDNA expression vectors and components thereof, nucleotide 
sequences or genes isolated by the method, substantially pure cell surface antigens encoded by the cDNA segments, 
2S and methods of using the isolated nucleotide sequences and encoded products. 

In the following description, reference will be made to various methodologies known to those of skill in the art of re- 
combinant genetics. Publications and other materials setting forth such known methodologies to which reference is 
made are incorporated herein by reference in their entireties. 

Standard reference works setting forth the general principles of recombinant DNA technology include Darnell, J.E. et 
30 al., Molecular Cell Biology, Scientific American Books, Inc., publisher, New York, N.Y. (1986); Lewin, B.M., Genes II, 
John Wiley & Sons, publisher. New York, N.Y. (1985); Old, R.W. et al., Principles of Gene Manipulation: An Introduction 
to Genetic Engineering, 2nd edition. University of California Press, Berkeley, CA (1961); and Maniatis, T et al.. Mo- 
lecular Cloning: A Laboratory Manual, Cold Spring Harbor, NY (1982). 

By "cbning" is meant the use of in vitro recombination techniques to Insert a particular gene or other DNA sequence 
3S into a vector molecule. In order to successfully clone a desired gene, it is necessary to employ methods for generating 
DNA fragments, for joining the fragments to vector molecules, for introducing the composite DNA molecule into a host 
cell in which it can replicate, and for selecting the clone having the target gene from amongst the recipient host cells. 
By "cDNA" is meant complementary or copy DNA produced from an RNA template by the action of RNA-dependent 
DNA polymerase (reverse transcriptase). Thus a 'cDNA clone" means a duplex DNA sequence domplementary to an 
40 RNA molecule of interest, carried in a cloning vector. 

By "cDNA library" is meant a collection of recombinant DNA molecules containing cDNA inserts which together com- 
prise the entire genome of an organism. Such a cDNA library may be prepared by art-recognized methods described, 
for example, in Maniatis et al.. Molecular Cloning: A Laboratory Manual supra. Generally, RNA is first isolated from the 
cells of an organism from whose genome it is desired to clone a particular gene. Preferred for the purposes of the 
45 present inventioin are mammalian, and particularly human, cell lines. More preferred are the human tumor cell, derived 
from an animal tumor, and preferably from a human tumor. Thus, a library may be prepared from, for example, a human 
adrenal tumor, but any tumor may be used. 

The immunoselectbn cloning method of the present invention comprises the preparation of a cDNA library by extracting 
total RNA including a particular gene from a cell, synthesizing a series of complementary double-stranded cDNA frag- 
so ments from the RNA and introducing these cDNA fragments into mammalian cells in tissue culture. The mammalian 

cells are maintained under conditions which allow them to express the protein (i.e. the cell surface antigen). The re- 
sulting cells are exposed to a first antibody or pool (group) of antibodies directed against the cell surface antigen. This 
results in information of a cell surface antigen-first antibody complex. The complexes are exposed to a substrate to 
which is coated or bound a second antibody directed against the first antibody. Cells expressing the cell surface antigen 
ss adhere to the substrate (because of fomnation of a cell surface antigen-first antibody-second antibody complex). Ad- 
herent cells are separated from nonadherent cells. 
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Isolation of total RNA 

The guanidium thIocyanate/CsCI method of isolating total RNA is preferred. More preferred Is a guanidlum thio- 
cyanate/LICI variant of the GuSCN/CsCI ntethod, which has added capacity and speed. Briefly, for each ml of mix 

5 desired, 0.5 g GuSCN are dissolved in 0.58 ml of 25 % LiCI (stock filtered through 0.45 micron filter) and 20 \i\ of 
mercaptoethanol is added. Cells are spun out and the pellet is dispersed on walls by flicking, add 1 ml of solution to 
up to 5 X 10^ cells. The resulting combinaton is sheared by polytron until nonviscous. For small scale preps (less than 
10® cells) layer 2 ml of sheared mix on 1 .5 ml of 5.7 M CsCI (RNase free; 1 .26 g CsCI added to every ml 10 mM EDTA 
pH 8), overlay with RNase-free water and spin SW55 50 krpm 2 h. For large scale preps, layer 25 ml on 12 ml CsCI 

10 in a SW 28 tube, overlay, and spin 24k rpm 6 h. Aspirate contents carefully with a sterile pasteur pipet connected to a 
vacuum flask. Once past the CsCI interface, scratch a band around the tube with the pipet tip to prevent the layer on 
the wall of the tube from creeping down. The remaining CsCI solution is aspirated. The pellets are taken up in water 
(do not try to redissolve). 1/1 0 vol. NaOAc and 3 vol. EtOH are added and the resulting combination is spun. If necessary, 
the pellet is resuspended in water (e.g., at 70°C). Adjust concentration to 1 mg/ml and freeze. Small RNA (e.g. 5S) 

IS does not come down. For small amounts of cells, scale down volumes and overlay GuSCN with RNase-free water on 
gradient (precipitation is inefficient when RNA is dilute). 

Preparation of poly A+ RNA 

20 Next. polyA* RNA may be prepared, preferably by the oligo dT selection method. Briefly, a disposable polypropyl- 

ene column is prepared by washing with 5M NaOH and then rinsing with RNase-free water. For each milligram total 
RNA about 0.3 ml (final packed bed) ollgo dT cellulose Is used. Oligo dT cellulose is prepared by resuspending about 
0,5 ml of dry powder in 1 ml of O.lfvl NaOH and transferring it into the column, or by percolating 0.1 NaOH through a 
previously used column (columns can be reused many times). This is washed with several column volumes of RNase- 

2S free water, until plH is neutral, and rinsed with 2-3 ml of loading buffer. The column bed is then removed Into a sterile 
15 ml tube using 4-6 ml of loading buffer. The total RNA to TO^C for 2-3 min., LiCl from RNase-free stock is added (to 
0.5M), and combined with oligo dT cellulose in a 15 ml tube. This is followed by vortexing or agitation for 10 min. The 
result is poured into a column and washed with 3 ml loading buffer and then 3 ml of middle wash buffer. mRNA Is eluted 
directly into an SW55 tube with 1 .5 ml of 2mM EDTA, 0.1 % SDS; the first two or three drops are discarded. 

30 Eluted mRNA is precipitated by adding 1/10 vol. 3M NaOAc and filling the tube with EtOH. This is then mixed, chilled 
for 30 minutes at -20*C, and spun at 50k rpm at 5*C for 30 min. The EtOH is poured off and the tube is air dried. The 
mRNA pellet is resuspended in 50-100 \i\ of RNase-free water. Approximately 5 ^1 is melted at 70"C in MOPS/EDTA/ 
formaldehyde and run on an RNase-free 1 % agarose gel to check quality. 

3S cDNA Synthesis 

From this, cDNA is synthesized. A preferred method of cDNA synthesis is a variant of that described by Gubler 
and Hoffman (Gene 25: 263-269 (1962)). This is carried out as follows: 

40 a. First Strand. 4 ^g of mRNA and heated to about 100^C in a microfuge tube for 30 seconds and quenched on 

Ice. The volume is adjusted to 70 \i\ with RNase-free water. The following are added: 20 ^1 of RT1 buffer. 2 ^1 of 

RNase inhibitor (Boehrlnger 36 1 ^l of 5 [lq/\l\ of oligo dT (Collaborative Research), 2.5 )il of 20mM dXTP's 
(uitrapure), 1 ^1 of 1 M DTT and 4 \i\ of RT-LX (Life Science, 24 jj/jil). The resulting combination is incubated at 
42°C for 40 min. It is heated to inactivate (70*C 10 min). 

45 

b. Second Strand. 320 ^1 of RNase free water, 80 ^1 of RT2 buffer, 5 ^1 of DNA Polymerase I (Boehringer, 5 U/jil), 
2 ^1 RNase H (BRL 2 p/^il). Incubate at 15°C for 1 h and 22*'C for 1 h. Add 20 \i\ of 0.5 M EDTA pH 8.0, phenol 
extract and EtOH precipitate by adding NaCI to 0.5 M, linear polyacrylamide (carrier) to 20 ^g/ml, and filling tube, 
with EtOH. Spin 2-3 minutes in microfuge, remove, vortex to dislodge precipitate high up on wall of tube, and respin 

so 1 minute. 

c. Adaptors. Resuspend precipitated cDNAin 240 ^1 of TE (10/1). Add 30 |il of 10 X low salt buffer, 30 ^1 of 10 X 
ligation additions, 3 jil (2.4 \ig) of kinased 12-mer adaptor, 2 |il (1 .6 jig) of kinased 8-mer adaptor, and 1 \i\ of T4 
DNa ligase (BioLabs), 400 or Boehringer, 1 Weiss unit/ml). Incubate at 15*C overnight. Phenol extract and 

ss EtOH precipitate as above (no extra carrier now needed), and resuspended in 100 ^1 of TE. 
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Use of cDNA f ragements In expression vectors 

For use with the BstXI-based cDNA expression vectors of the invention, (see infra) oligonucleotide segments 
containing terminal sequences corresponding to BstXI sites on the vectors are ligated to the cDNA fragment desired 

5 to be inserted. The resulting fragments are pooled by fractionation. A preferred method is as follows: 

Prepare a 20 % KOAc. 2 mM EDTA, 1 ^l/ml EthBr solution and a 5 % KOAc, 2 mM EDTA. 1 ^g/ml EthBr solution. 
Add 2.6 ml of 20 %KOAc solution to back chamber of a small gradient maker. Remove air bubble from tube connecting 
the two chambers by allowing solution to flow into the front chamber and then tilt back. Close passage between cham- 
bers, and add 2.5 ml of the 5 % solution to run Just to the end of the tubing, and then return to chamber. Place the 

10 apparatus on a stirplate, set the stir bar moving as fast as possible, open the stopcock connecting the two chambers 
and then open the front stopcock. Fill a polyallomer SW55 tube from the bottom with the KOAc solution. Overlay the 
gradient with 100 \i\ of cDNA solution. Prepare a balance tube and spin the gradient for 3 hrs at 50 k rpm at 22°C. To 
collect fractions from the gradient, pierce the SW55 tube with a butterfly infusion set (with the luer hub clipped off) 
close to the bottom of the tube and collect three 0.5 ml fractions and then 6 0.25 ml fractions Into microfuge tubes 

15 (about 22 and 11 drops respectively). EtOH precipitate the fractions by adding linear polyacrylamide to 20 ^g/ml and 
filling the tube to the top with EtOH. After cooling tubes, spin them in a microfuge for 3 min. Vortex and respin 1 min. 
Rinse pellets with 70 % EtOH (respin). Do not dry to completion. Resuspend each 0.25 ml fraction in 10 ^1 of TE. Run 
1 ^l on 1 % agarose minigel. Pool the first three fractions, and those of the last six which contain no material smaller 
than 1 kb. 

20 Suppressor tRNA plasmlds may be propagated by known methods. In a preferred method according to the present 
invention, supF plasmids can be selected in nonsuppressing hosts containing a second plasmid, p3, which contains 
amber mutated ampiclllln and tetracycline drug resistance elements (Seed, 1 983). The p3 plasmid is derived from PR1 , 
is 57 kb In length, and Is a stably maintained, single copy episome. The ampiclllln resistance of this plasmid reverts at 
a high rate, so that amp^ plasmlds usually cannot e used in p3-containing strains. Selection for tet resistance abne is 

25 almost as good as selection for ammp+tet resistance. However, spontaneous appearance of chromosomal suppressor 
tRNA mutations presents an unavoidable background (frequency about 10*^) suppressor mutations are usually bigger 
than colonies arising from plasmid transformation. Suppressor plasmids typically are selected for in LB medium con- 
taining amp at 1 2.5 |ig/ml and tet at 7.5 ^g/ml. For large plasmid preps, M9 casamino acids medium containing glycerol 
(0.8 %) may be used as a carbon source, and the bacteria grown to saturation. P 

30 Vector DNA may be isolated by known methods. The following method is preferred for plasmid from 1 liter of 

saturated cells: 

Spin down cells in 1 liter J6 bottles, 4.2k rpm, 25 minutes. Resuspend in 40 ml 10 mM EDTA pH 8 (Thump on soft 
surface). Add 80 ml 0.2 M NaOH. 1 % SDS. swirl until clearish, viscous. Add 40 ml 5 M KOAc, pH 4.7 (2.5 M KOAc. 
2.5 M HOAc) shake (same bottle) 4.2 rpm, 5 min. Pour supernatant through chesecloth into 250 ml bottle. Fill bottle 

3S with Isopropyl alcohol. Spin J6, 4.2 k rpm, 5 min. Drain bottle, rinse gently with 70 % EtOH (avoid fragmenting the 
pellet). Invert bottle, and remove traces of EtOH with KImwIpe. Resuspend in 3.5 ml Tris base/EDTA 20 mM/ 10 mM. 
Add 3.75 ml of resuspended pellet to 4.5 g CsCl. Add 0.75 ml 10/mg/ml ethidium bromide, mix. Fill VTi80 tubes with 
solution. Run at a speed of 80 rpm for 2.5 hours or longer. Extract bands by visible light with 1 ml syringe and 20 gauge 
or lower needle. Cut top off tube, insert the needle upwards Into the tube at an angle of about 30'*C with respect to the 

<o tube. (i.e.. as shallowty possible) at a position about 3 mm beneath the band, with the bevel of the needle up. After the 
band is removed, pour tube contents into bleach. Deposit extracted bands in 1 3 ml Sarstedt tube. Fill tube to top with 
n-butanol saturated with 1 M NaCI, extract. If a very large quantity of DNA is obtained, reextract. Aspirate butanol into 
trap containing 5M NaOH (to destroy ethidium). Add about equal volume 1 M ammonium acetate to DNA (squirt bottle). 
Add about 2 volumes 95 % ethanol (squirt bottle). Spin 10K rpm, 5 min. J2-21 . Rinse pellet carefully with 70 % ethanol. 

45 Dry with swab, or lyophillzer. The vector may be prepared for cloning by known methods. A preferred method begins 
with cutting 20 |xg of vector In a 200 reaction with 100 units of BstXI (New York Biolabs), cutting at 50*C overnight 
in a well-thermostatted water bath (i. e., circulating water bath). Prepare 2 KOAc 5-20 % gradients in SW55 tubes as 
described above. Add 100 \i\ of the digested vector to each tube and run for 3 hrs, 50K rpm at 22°C. Examine the tube 
under 300 nm UV light. The desired band will have migrated 2/3 of the length of the tube. Fonward trailing of the band 

50 means the gradient is overloaded. Remove the band with a 1 ml syringe and 20 gauge needle. Add linear polyacrylamide 
and precipitate the plasmid by adding 3 volumes of EtOH. Resuspend in 50 ^l of TE. Set up ligations using a constant 
amount of vector and Increasing amounts of cDNAs. On the basis of these trial ligations, set up large scale ligation, 
which can be accomplished by known methods. Usually the entire cDNA prep requires 1-2 |ig of cut vector. 
Adaptors may be prepared by known methods, but it is preferred to resuspend crude adaptors at a concentration of 1 

55 ^g/jil, add MgS04 to 10 mM, and precipitate by adding 5 volumes of EtOH. Rinse with 70 % EtOH and resuspend In 
TE at a concentration of 1 |ig/MJ- To kinasing buffer and 20 units of kinase; incubate 37''C overnight. 
Preparation of buffers mentioned in the above description of preferred methods according to the present invention will 
be evident to those of skill. For convenience, preferred buffer compositions are as follows: 
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Loading Buffer: 

Middle Wash Buffer: 

RT1 Buffer: 

RT2 Buffer': 

10X Low Salt: 

10X Ligation Additions: 

10X Kinasing Buffer: 



0.5 M LiCI, TO mM Tris pH 7.5, 1 mM EDTA 0.1 % SDS. 

0.15 M LiCI, 10 mM Tris pH 7.5, 1 mM EDTA 0.1 % SDS. 

0.25 M Tris pH 8.8 (8.2 at 42'»C), 0.25 M KC1, 30 mM MgClg. 

0.1 M Tris pH 7.5, 25 mM MgClg, 0.5 M KCl. 0.25 mg/ml BSA, 50 mM DTT 

60 mM Tris pH 7.5", 60 mM MgClg, 50 mM NaCI. mM NaCI, 2.5 mg/ml BSA 70 mM Me. 

1 mM ATP, 20 mM DTT, 1 mg/ml BSA 10 mM spermidine 

0.5 M Tris pH 7.5, 10 mM ATP, 20 mM DTT, 10 mM spermidine, 1 mg/ml BSA lOOmM MgClg. 



By "vector" is meant a DNA molecule, derived from a plasmid or bacteriophage, into which fragments of DNA may 
10 be inserted or cloned. A vector will contain one or more unique restrictioin sites, and may be capable of autonomous 
replication in a defined host or vehicle organism such that the cloned sequence is reproducible. Thus, by "DNA ex- 
pression vector" is meant any autonomous element capable of replicating in a host Independently of the host's chro- 
mosome, after additional sequences of DNA have been incorporated into the autonomous element's genome. Such 
DNA expression vectors include bacterial plasmids and phages. 
IS Preferred for the purpose of the present Invention, however, are viral vectors, such as those derived from simian 

virus strain 40 (SV40). SV40 is a papovavirus having a molecular weight of 28 Mdal, and confining a circular double- 
stranded DNA molecule having a molecular weight of 3 Mdal, which comprises the entire genome of the virus. The 
entire nucleotide sequence of this single, small, covalently closed circular DNA molecule has been determined. Piers 
etal.. Nature 273: 113-120(1976); Reddyetal.. Science 200: 494-502(1978). The viral DNA of SV40 may be obtained 
20 in large quantities, and the genomic regions responsible for various viral functions have been accurately located with 
respect to a detailed physical map of the DNA. Flers et al.. supra; Reddy et al., supra. The viral genome of SV40 can 
multiply vegetatively or as an integral part of cellular chromosomes, and a wealth of information exists on the replication 
and expression of this genome. 

Also preferred for the purposes of the present invention is a single-stranded bacteriophage cloning vehicle, designated 

2S Ml 3, having a closed circular DNA genome of approximately 6.5 kg. An advantage of utilizing Ml 3 as a cloning vehicle 
is that the phage particles released from infected cells contain singte-stranded DNA homologous to only one of the 
two complementary strands of the cloned DNA. which therefore can be used as a template for DNA sequencing analysis. 
Even more preferred for the purposes of the present invention are the expression vectors designated piH3, piH3M, 
and CDM8, deposited at the American Type Culture Collection (ATCC), 1 2301 Parklawn Drive, Rockville, MD 20852, 

30 on February 24, 1 988. in E. coli. piH3 has accession number ATCC 67,634, piH3M has accession number ATCC 67.633 
and CDM8 has accesston number ATCC 67,635. 

By "tissue culture" is meant the maintance or growth of animal tissue cells in vrtro so as to allow further differentiation 
and preservatkxi of cell architecture of function or both. "Primary tissue cells" are those taken directly from a population 
consisting of cells of the same kind performing the same functioln In an organism. Treating such tissue cells with the 

3S proteolytic enzyme trypsin , for example, dissociates them into individual primary tissue cells that grow well when seeded 
onto culture plates at high densities. Cell cultures arising from multiplication of primary cells In tissue culture are called 
'secondary cell cultures." Most secondary cells divide a finite number of times and then die. A few secondary cells, 
however, may pass through this 'crisis period", after which they are able to multiply indefinitely to form a continous 
"cell line". Celt lines often will contain extra chromosomes, and usually are abnormal in other respects as well. The 

40 immortality of these cells is a feature shared in comon with cancer cells. 

Preferred celt tines for use as tissue culture cells according to the present Invention include the monkey kidney cell 
line, designated "COS'. COS cells are those that have been transformed by SV40 DNA containing a functional early 
gene region but a defectie origin of viral DNA replication. COS cell clone M6 is particularly preferred for use according 
to the method of the Invention. Also preferred for the purposes of the present invention are murine "WOP" cells, which 

45 are NIH 3T3 cells transfected with polyoma origin deletion DNA. cDNA may be introduced into the host tissue culture 
cells of the present invention by any methods known to those of skill. Transfection may accomplished by, for example, 
protoplast fusion by spheroplast fusion, or by the DEAE dextran method (Sussman et al., Cell. Biol. 4: 1641-1643) 
(1984)). 

If spheroplast fusion Is employed, a preferred method is the.foltowing variant based on Sandri-Goldrin et al., Mol. 

so Cell Bio. 1: 743-752 (1981). Briefly, for example, a set of six fusions requires 100 ml of cells in broth. Grow celts 
containing amplrfiable plasmid to OD600 = 0.5 in LB, Add spectinomycin to 100 ^ig/ml (or chloramphenicol to 1 50 ^g/ 
ml). Continue incubation at 37*C with shaking for 10-16 hours. (Cells begin to lyse with prolonged Incubation in spec- 
tinomycin or chloramphenicol medium). Spin down 100 ml of culture (JA14/GSA rotor. 250 ml bottle) 5 min. at 10,000 
rpm. Drain well, rersuspend pellet in bottle with 5 ml cold 20 % sucrose, 50 mM Tris-HCI pH 8.0. Incubate on ice 5 

55 min. Add 2 ml cold 0.25 EDTA pH 8.0, incubate 5 min. at 37°C (wateibath). Place on tee, check percent conversion to 
spheroplasts by microscopy. In flow hood, slowly add 20 ml of cold DME/10 % sucrose/10 mM MgCt2 (dropwise, ca. 
2 drops per second). Remove media from cells plated the day before in 6 cm dishes (50 % confluent). Add 5 ml of 
spheroplast suspension to each dish. Place dishes on top of tube carriers in swinging bucket centrifuge. Up to 6 dishes 
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can be comfortably prepared at once. Dishes can be stacked on top of each other, but 3 in a stack is not advisable as 
the spheroplast layer on the top dish is often torn or detached after centrifugation. Spin at 1000 xg 10 min. Force is 
calculated on the basis of the radius to the bottom plate. Asirate fluid from dishes carefully. PIpet 1 .5-2 ml 50 % (wAv) 
PEG1450 (or PEG1000)/50 % DME (no serum) into the center of the dish. If necessary, sweep the pipet tip around to 

5 ensure that the PEG spreads evenly and radially across the whole dish. After PEG has been added to the last dish, 
prop all of the dishes up on their lids so that the PEG solution collects at the bottom. Aspirate the PEG. The thin layer 
of PEG that remains on the cells is sufficient to promote fusion; the layer remaining is easier to wash off, and better 
cell viability can be obtained, than if the bulk of the PEG is left behind. After 90 to 120 seconds (PEG 1000) or 120 to 
150 seconds (PEG 1450) of contact with the PEG solution, pipet 1.5 ml of DME (no serum) into the center of the dish. 

10 The PEG layer will be swept radially by the DME. Tilt the dishes and aspirate. Repeat the DME wash. Add 3 ml of 
DME/10 % serum containing 15 ^g/ml gentamicin sulfate. Incubate 4-6 hours in incubator. Remove media and remain* 
ing bacterial suspension, add more media and incubate 2-3 days. Extensive washing of the cell layer to remove PEG 
tends to remove many of teh cells without any substantial benefit. If the cells are allowed to sit in the second DME 
wash for a few minutes, most of the spheroplast layer will come up spontaneously; however, it is preferred to wash 

IS briefly and allow the layer to come off in the complete medium at 37" C. 

The PEG solution can be conveniently prepared by melting a fresh bottle of PEG at 60"C and pouring approximate 50 
ml aliquots by means of a 50 ml centrifuge tube into preweighed bottles. The aliquoted PEG is stored at 5 "C in the 
dark. To make up a fresh bottle, weigh the aliquot, remelt, and add an equal volume of DME (no serum). Adjust the 
pH with 7.5 % Na Bicarbonate solution if necessary, and filter sterilize. The resulting PEG solutbn may be stores up 

20 to 3 months at room temperature without detectable adverse consequence. 

Transfected host cells will be cultures according to the invention in order to accomplish expression of the protein 
encoded by the cDNA clone, and to increase the absolute numbers of cells available for subsequent immunoselection. 
Those skilled in the art will know of appropriate methods and media for this purpose, taking into account the cell type 
and other variables routinely considered. COS cells, for example, may be cultured In Dulbecco's modified Eagle's 

2S medium (DME) supplemented with 10 % calf serum and gentamycin sulfate. Transient expression of transfected cells 
normally can be expected between 48 and 72 hours posttransfectlon. However, this time perkxl may vary depending 
upon the type or strain of host cell used and the cell culture conditbns, as will be apparent to those of ordinary skill. 
Immunoprecipitation, blotting, and cDNA sequencing of genes clones according to the methods of the present invention 
may be carried out by any convenient methods known to those of skill. For examie, the immunoprecipitation protocol 

30 of Clark et al., Leukocyte Typing II, Vol. 11, pp. 155-167 (1986), is preferred. Southern, Northern, or other blot analysis 
methods known to those of skill may be employed, using hybridization probes prepared by known methods, such as 
that of Hu et al. (Gene 1 6: 27 1 -277 (1 962)). cDNA sequencing also may be accomplished by known methods. Including 
the dideoxynucleotide methkod of Sanger et al., PN.A.S. (USA) 74: 5463-5467 (1977). 

The antibodies used according to the present invention may be polyclonal or monoclonal. These may be used singly 
35 or in conjunctioin with other polyclonal or monoclonal antibodies to effect immunoselection of cells expressing the 
desired antigen or antigens by the methods of the present invention. Methods of preparing antibodies or fragments 
thereof for use according to the present invention are known to those of skill. 

Standard reference works setting forth general principles of immunology include Klein, J., Immunology: The Science 
of Self-Nonself Discrimination, John Wiley & Sons, publisher, New York (1982); Kennett, R., et al. eds.. Laboratory 

^0 Techniques in Biochemistry and Molecular Biology, Vol. 13, Elsevere, publisher, Amsterdam (1984). 

The term "antibody" is meant to include the intact molecule as well as fragments thereof, such as, for example, Fab 
and (F(ab)'2 fragments; which also are able to bind to antigen. Polyclonal antibody preparations may be derived directly 
from the blood of the desired aninrtal species after immunization with the antigen of interest, or a fragment thereof, 
using any of the standard protocols known to those of ordinary skill. Similarly monoclonal antibodies may be prepared 

45 using known methods (Kohler et al., Eur. J. Immunol. 6: 292 (1976)). Use of monoclonal antibodies is preferred for the 
purpose of the present invention. 

For the purpose of immunoselection according to the present invention, the tissue culture host cells which have been 
exposed to antibodies directed against the target cell surface antigen are separated from hsot cells which do not 
express the target antigen by distributing the cells onto a substrate coated with antibody driected against the antibody 
so for the antigen. This technique, termed "panning", will be known to those of skill, and is described, for example, by 
Mage et al., J. Immunol. Meth. 15: 47-56 (1977), and Wysocki and Sato, PN.A.S. (USA) 75: 2844-2848 (1978). 
Panning according to the methods of the present invention may be carried out as follows: 

a. Antibody-coated dishes. Bacteriological 600 mm plates, Falcon 1007 or equivalent, or 10 cm dishes such as 
65 Fischer 8-757-1 2 may be used. Sheep antimouse affinity purified antibody (from, for example, Cooper BioMedical 

(Cappell)) is diluted to 10 M.g/ml in 50 mM Tris HCI, pH 9.5. Add 3 ml per 6 cm dish, or 10 ml per 10 cm dish. Let 
sit ca. 1.5 hrs., rerrtove to next dish 1.5 hrs., then to 3rd dish. Wash plates 3 x with 0.15 NaCI (a wash bottle is 
convenient for this), incubate with 3 ml 1 mg/ml BSA in PBS overnight, aspirate and freeze. 



10 



7/11/2007, EAST Version: 2.0.3.0 



EP 0 739 980 A2 



b. Panning. Cells will be in 60 mm dishes. Aspirate medium from dish, add 2 ml PBS/D.5 mM EDTA/0.02 % azide 
and incubate dishes at 37**C for 30 min. to detach cells from dish. Triturate cells vigorously with short pasteur pipet, 
and collect cells from each dish rn a centrifuge tube. Spin 4 min. setting 2.5 (200 x g) (takes 5 min). Resuspend 
cells in 0.5 -1 ml PBS/EDTA/azide, layer carfully on 3 ml PBS/EDTA/a2ide/2 % Ficoll. and spin 4 min. at setting 

s . 2.5. Aspirate supernatant in one smooth movement. Take up cells In 0.5 ml PBS/EDTA/azide and add aliquots to 
antibody -coated dishes containing 3 ml PBS/EDT/\/azide/5 % FBS by pipetting through 100 micron Nylon mesh 
(Tetko). Add cells from at most two 60 mm dishes to one 60 mm antibody-coated plate. Let sit at room temperature 
1-3 hours. Remove excess cells not adhering to dish by gentle washing with PBS/5 % serum or with medium. 2 
or 3 washes of 3 ml are usually sufficient. 

10 

c. Hirt Supernatant. A preferred variant of the method of Hirt, J. Molec. Biol. 26: 365-369 (1 967), Is as follows: Add 
0.4 ml 0.6 % SDS, 1 0 mM EDTA to panned plate. Let sit 20 minutes (can be as little as 1 min. it there are practically 
no cells on the plate). Pipet viscous mixture into microfuge tube. Add 0.1 ml 5 M NaCI, mix, put on ice at least 5 
hrs. Keeping the mixture as cold as possible seems to improve the quality of the Hirt. Spin 4 min., remoe supernatant 

IS carefully, phenol extract (twice if the first interface is not clean), add 10 |ig linear polyacrylamide (or other carrier), 

fill tube to top with EtOH, precipitate, and resuspend in 0.1 ml. Add 3 volumes EtOH/NaOAc, reprecipitate and 
resuspend in 0.1 ml. Transform into MC 1061/p3, preferably unsing the high efficiency protocol hereinafter de- 
scribed. If the DNA volume exceeds 2 % of the competent cell aliquot, the transformatton efficiency will suffer. 5 
% gives the same number of colonies as 2.5 % (efficiency is halved). It is preferred for this aspect of the present 

20 invention to use 'blockers" in the incubation medium. Blockers assure that non-specific proteins, proteases, or 

antibodies present do not cross-link with or destroy the antibodies present on the substrate or on the host cell 
surface, to yield false positive or false negative results. Selection of blockers can substantially improve the spe- 
cificity of the immunoselection step of the present Invention. A number of non-specific monoclonal antibodies, for 
example, of the same class or subclass (isotype) as those used in the immnoselection step (e.g., IgG^, lgG2A, 

2S IgGm. etc.) can be used as blockers. Blocker concentratton (normally 1-100 \igf\i\) is important to maintain the 

proper sensitivity yet Inhibit unwanted Interference. Those of skill also will recognize that the buffer system used 
for incubation may be selected to optimize bbcking action and decrease non-specific binding. 

A population of cells to be panned for those expressing the target cell surface antigen is first detached from its cell 

30 culture dish (han/ested) without trypsin. The cells then are exposed to a first antibody which may be polyclonal or 
monoclonal, directed against the antigen of interest or against a family or related antigens. At this initial stage, a single 
antibody or a group of antibodies may be used, the choice depending upon the nature of the target antigen, its antic- 
ipated frequency, and other variables that will be apparent to those of skill. Target antigens expressed on the surfaces 
of host cells will form an antigen-antibody complex. 

3S The cells subsequently are placed in close apposition to a substrate, such as a culture dish, filter disc, or the like, which 
previously has been coated with a second antibody or group of antibodies. This second antibody will be directed against . 
the first antibody, and its choice will be a matter of ordinary skill dictated by for example, the animal in which the first 
antibody was raised. For example, if the first antibody was raised in mice, the second antibody might be directed against 
mouse immunoglobulins, raised in goats or sheep. Cells expressing the target antigen will adhere to the substrate via 

40 the complex formed between the antigen, the first antibody, and the second antibody. Adherent cells then may be 
separated from nonadherent cells by washing. DNA encoding the target antigen is prepared from adherent cells by 
known methods, such as that of Mirt, J. Molec. Biol. 26: 365-369 (1967). This DNA may be transfomned into E. coli or 
other suitable host cells for further rounds of fusion and selection, to achieve the desired degree of enrichment. In the 
usual case, the initial rounds of immunoselectioin will employ a panel of first antibodies directed against an epitope or 

4£ group of epitopes common to the family of antigens to which the target antigen belongs. This will be sufficient to narrow 
the number of clones for future rounds quite significantly Two such rounds usually will be found adequate, but the 
number of rounds may vary as mentioned above. Thereafter, a single round of selection may be performed employing 
a single first antibody or a group of first antibodies recognizing only the target antigen. By 'substrate" is meant a solid 
surface to which antibodies may be bound for immunoselection according to the present inventk>n. Known suitable 

so substrates include glass, polystyrene, polypropylene, dextran, nylon, and other materials. Tubes, beads, microtiter 
plates, bacteriological culture dishes, and the like formed from or coated with such materials may be used. Antibodies 
may be covalently or physically bound to the substrate by known techniques, such as covalent bonding via an amide 
or ester linkage, or by absorption. Those skilled in the art will know may other suitable substrates and methods for 
immobilizing antibodies thereupon, or will be able to ascertain such substrates and methods using no more than routine 

ss experimentation. 

The choice of host tissue culture cells for use according to the present invention preferably should be such as to avoid 
the situation in which the antibodies used fpr panning recognize determinants on untransfected cells. Thus, while COS 
cells are preferred for transient expressk>n of certain surfce antigens, more preferred are murine WOP cells. Of the 
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latter, WOP 3027 cells are even more preferred. WOP cells allow virtually all antibodies to be used, since cross-reac- 
tions between murine antibodies and murine cell surface determinants are rare. 

the insert size of the recombinant DNA molecule should be chosen to maximize the likelihood of obtaining an .entire 
coding sequence. Those of skill will know various methods by which a preliminary determination of optimal Insert size 
5 for a given gene may be determined. 

Vector construction and cDNA insertbn 

Vectors suitable for expressloin of cDNA in mammalian tissue culture cells may be constructed by known methods. 

10 Preferred for the purposes of the present invention is an expression vector containing the SV40 origin. The vector may 
contain a naturally derived or synthetic transcription origin, and the SV40 early region promoter. Even more preferred 
Is a chimeric promoter composed of human cytomegalovirus immediate early enhancer sequences. \^rious "enhancer 
sequences" also may be used with SV40 vectors. These are described, for example, by Banerji et al., Cell 27: 299-308 
(1981); Levinson et al., Nature 295: 568-572 (1982); and Conrad et a!., Mol. Cell. Biol. 2: 949-965 (1982). 

IS Insertion of DNA into the vectors of the present invention can occur, for example, by homopolymeric tailing with terminal 
transferase. However, homopolymeric tracts located 5' to cDNA inserts may inhibit In vitro and in vivo expression. Thus, 
preferred for purposes of the present invention is the use of inverted identical cleavage sites separated by a short 
replaceable DNA segment. Such inverted identical cleavage sites, preferably employing the BstXI restrictbn endonu- 
clease, may be used in parallel with cDNA synthetic oligonucleotides, giving the same tenminii as the replaceable 

20 segment of the vector. In this manner, the cDNA cannot ligate to itself, but can ligate to the vector. This allows the most 
efficient use of both cDNA and vector. 

Another embodiment of the present invention is the above-described efficient oligonucleotlde-based strategy to pro- 
mote cDNA insertion into the vector. The piH3M vector of the present invention is preferred, and employs the inverted 
endonuclease sites. This vector may contain an SV40 origin of replication, but a more preferred form contains an Ml 3 
25 origin. This vector, containing the M1 3 origin, allows high level expressbn in COS cells of coding sequences placed 
under Its control. Also, the small size and particular arrangement of sequences In the plasmid permit high level repli- 
cation in COS cells. 

By "cell surface antigen" is meant any protein that is transported through the intracellular membrane system to the cell 
surface. Such antigens normally are anchored to the cell surface membrane through a carboxyl terminal domain con- 

30 taining hydrophobic amino acids that lie in the lipid bilayer of the membrane, and there exert their biological and anti- 
genic effects. Antigens such as those of T-lymphocytes are particularly suited for gene cloning by the method of the 
present invention. However, cell surface antigens of any cells may be cloned according to the present method. More- 
over, proteins not normally expressed on the cell surface may admit of cloning according to the present method by, for 
example, using fluorescence activated cell sorting (FACS) to enrich for fixed cells expressing intracellular antigens. 

35 By "substantially pure" is meant any antign of the present invention, or any gene encoding any such antigen, whteh is 
essentially free of other antigens or genes, respectively, or of other contaminants with which it might normally be found 
in nature, and as such exists in a form not found in nature. By "functional derivative" is meant the "fragments", "variants", 
"analogs", or "chemical derivatives" of a molecule. A "fragment" of a molecule, such as any of the antigens of the 
present invention, is meant to refer to any polypeptide subset of the molecule. A "variant" of such molecules is meant 

40 to refer to a naturally occurring molecule substantially similar to either the entire molecule, or a fragment thereof. An 
'analog" of a molecule is meant to refer to a non-natural molecule substantially similar to either the entire molecule or 
a fragment thereof. 

A molecule is said to be "substantially similar" to another molecule if the sequence of amino acids in both molecules 
is substantially the same, and if both molecules possess a similar biological activity Thus, provided that two molecules 

46 possess a similar activity, they are considered variants as that term is used herein even if one of the molecules contains 
additional amino acid residues is not identical. As used herein, a molecule is said to be a "cherriical derivative" of 
another molecule when it contains additional chemical moieties not normally a part of the molecule. Such moieties 
may improve the molecule's solubility, absorption, biological half life, etc. The moieties may alternatively decrease the 
toxicity of the molecule, eliminate or attenuate any undesirable side effect of the molecule, etc. Moieties capable of 

50 mediating such effects are disclosed, for example, in Remington's Pharmaceutical Sciences, 1 6th ed. , Mack Publishing 
Co., Easton, penn. (1980). 

Similarly, a functional derivative" of a gene of any of the antigens of the present invention is meant to include "frag- 
ments", "variants", or "analogues" of the gene, which may be "substantially similar" in nucleotide sequence, and which 
encode a molecule possessing similar activity. 
55 The substantially pure antigens that hae been expressed by methods of the present invention may be used in immu- 
nodiagnostic assay methods well known to those of skill, including radio-immunodassays (RIAs), enzyme immu- 
noassays (ElAs) and enzyme-linked immunosorbent assays (ELISAs). The substantially pure proteins of the present 
inventbn, in soluble form, may be adminstered alone or in combination with other antigens, including lymphokines and 
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monokines or drugs, tor the treatment of imrnune-related diseases and disorders in animals, including humans. As 
examples of such disorders that may benefit from treatment with the substantially pure proteins of the present inventioin 
may be mentioned immune deficiency diseases, diseases of Immediate type hypersensitivity, asthma, hypersensitivity 
pneumonitis, immune-complex disease, vasculitis, systemic lupus erythematosus, rheumatoid arthritis, immunopath- 
5 ogenic renal injury, acute and chronic inflammation, hemolytic anemias, platelet disorders, plasma and other cell ne- 
oplasms, amyloidosis, parasitic diseases, multiple sclerosis, Guillaln-Barre syndrome, acute and subacute myopathic 
paralysis, myasthenia gravis, immune endocrinopathies. and tissue and organ transplant rejection, all as described in 
Petersdorf et al., eds., Harrison's Principles of Internal Medicine, supra. See also Weir, ed., supra; Boguslaski et al., 
eds.. supra; and Holborow et al.. eds., supra. 
10 When used for immunotherapy, the antigens of the present invention may be unlabeled or labeled with a therapeutic 

agent. Examples of therapeutic agents which can be coupled to the antigens of the invention for immunotherapy are 
drugs, radioisotopes, lectins, and toxins. 

The dose range for the administration of the antigens of the present invention are those large enough to produce 
the desired immunotherapeutic effects, such as unwanted cross-reactions, anaphylactic reactions, and the like. Gen- 
's erally. the dosage employed will vary with the age, condition, sex. and extent of the disease in the patient. Counterin- 
dications (if any), immune tolerance and other variables also will affect the proper dosage. Administration may be 
parenteral, by injection or by gradual perfusion over time. Administration also may be intravenous, intraparenteral, 
intramuscular, subcutaneous, or intradermal. 

Preparations for parenteral administration include sterile or aqueous or non-aqueous solutions, suspensions and 
20 emulsions. Examples of non-aqueous solvents include propylene glycol, polyethylene glycol, vegetable oils such as 
olive oil, and injectable organic esters such as ethyl oleate. Aqueous carriers include water, alcoholic and aqueous 
solutions, emulsions, or suspensions, including saline and buffered media. Parenteral vehicles include sodium chloride 
solution, Ringer's dextrose, dextrose and sodium chloride, lactated Ringer's, or fixed oils. Intravenous vehicles include 
fluid and nutrient replenishers, electrolyte replenishers, such as those based on Ringer's dextrose, and the like. Pre- 
2S servatlves and other additives also may be present, such as, for example, antimicrobials, antbxidants, chelating agents, 
Inert gases and the like. Such preparations, and the manner and method of making them, are known and descried, for 
example, in Remington's Pharmaceutical Science, 16th ed., supra. 

The antigens of the present Invention also may be prepared as medicaments or pharmaceutical compositions 
comprising the antigens, either alone or in combination with other antigens or other agents such as lymphokines, 
30 monokines, and drugs, the medicaments being used for therapy of animal. Including human, Immune-related indica-. 
tions. 

Although the antigens of the present invention may be administered alone, it is preferred that they be administered 
as a pharmaceutical composition. The compositions of the present invention comprise at least one antigen or its phar- 
maceutically acceptable salt, together with one or more acceptable carriers and optionally other therapeutic agents. 

3S By "acceptable" is meant that the agent or carrier be compatible with other ingredients of the composition and not 
injurious to the patient. Compositions include those suitable for oral, rectal, nasal, topical (including buccal and sub- 
lingual), vaginal, or parenteral administration. The compositions conveniently may be presented in unit dosage form, 
and may be prepared by methods well known in the pharmaceutical arts. Such methods include bringing into association 
the active ingredient with the carrier which constitutes one or more accesory ingredients. In general, compositions are 

40 prepared by uniformly and intimately bringing Into association the active ingredient with liquid carriers or finely divided 
solid carriers, or both, and shaping the product formed thereby, if required. 

Orally administered phamiaceutical compositions according to the present Invention may be in any convenient 
form, including capsules, cachets, or tablets, each containing a predetermined amount of the active ingredient. Powders 
or granules also are possible, as well as solution or suspension in aqueous or nonaqueous liquids, or oll-in-water liquid 

45 emulsions. The active ingredient also may be presented as a bolus, electuary or paste. 

Having now described the invention, the same will be more fully understood by reference to the following examples, 
which are not intended in any way to limit the scope of the invention. 

Example 

so 

Isolation and Molecular Cloning of the Human GDI 9 

The rapid Immunoselection cloning method of the present invention was applied to isolate and clone the GDI 9 
antigen. The nucleotide sequence of GDI 9 is shown in Figure 12. 

55 
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Claims for the following Contracting States : AT, BE, CH, DE, FR, GB, IT, LI, LU, NL, SE 

1 . cDN A comprising the nucleotide sequence shown in Fig. 1 2 coding for CD 1 9. 

2. An expression vector containing a cDNA according to claim 1 . 

3. A host cell containing an expression vector according to claim 2. 

4. A method for producing CD 1 9 antigen containing the steps: 

a) culturing a host cell according to claim 3; and 

b) isolating said antigen. 

Claim for the following Contracting State : ES 

1 . A method for producing CD 1 9 antigen encoded by a cDNA comprising the nucleotide sequence shown in Fig. 1 2 
coding for CD 1 9 containing the following steps: 

a) culturing a host cell containing an expression vector containing said cDNA; and 

b) isolating said antigen. 
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1 QKGTMTa (CTOTQCA AACAAAAAAA CCACCCaAC CAGCGGTGGT 
51 TTGTTTQCCG GATCAAGAfiC TACCAAQa TTTTCCGAAG GAAaGGCTT 
101 CAGCAGAGCG CAGATACCAA ATAaGTCQ laAGTGTAG CCGTAGTTAG 
151 GCCACCACTT CAAGAAGa GTAGCACCGC aACATACCT CGCTCTGCTA 
201 ATCaGTTAC CAGTGGCTGC TGCCAGTGGC GATAAGTCGT GTOTACCGG 
251 GnGGAQCA AGACGATAGT TACCGGATAA QQCGCAGCG6 TCGGGCTGAA 
301 CGGGGQGTTC GTGCACACAG CCCAGCTTQG AQCGAACGAC QACACCGAA 
351 CTGAGATACC TACAQCGTGA OaATGAGAA AGCGCCACGC HCCCGAAGG 
401 GAGAAAQGCG GACAGGTATC CGGTAAGCGG CAGQGTCGGA ACAQGAGAGC 
451 GCACGAGGGA GOTCCAGGG GGAAACGCa GGTATCITTA TAGTCCTGTC 
501 GGGTrrCGCC ACaCTGACT TGAGCGTCGA TTTTTGTGAT GaCGTCAGG 
551 GGQQCQGAGC QATGGAAAA ACGCCAGCAA CQCCGAATTa CCQCGGTCTT 
601 TQCAACGTA ACACTTTACA QCQQCQCGTC ATTTGATATG ATGCGCCCCG 
651 OTCCCGATA AQGGAGCAGG CCAGTAAAAG CATTACCCGT GGTGGGGTTC 
701 CCGAGCGGCC AAAGGGAGCA GAQCTAAAT CTGCCGTCAT CGACHCGAA 
751 GGHCGAATC CTTCCCCCAC CACCATCACT TTCAAAAGTC CGAAAGAAtC 
801 TCaCCaGC nGIGTCrrG GAOGTCCaG AGTAGTGCGC GAGTAAAA7T 
851 TAAGCTACAA CAAGGCAAGG CTTGACCGAC AAHGCATGA AGAATaOCT 
901 TAGGGTTAGG CGnTTQCQC TQC7TCGCGA TGTACQGQCC AGATATACGC 
951 GTTGACAnG ATTATTGACT AGHATTAAT AGTAATCAAT TACGGGGTCA 
1001 TTAGnCATA GCCCATATAT GGAGTTCCQC GHACATAAC TTACGGTAAA 
1051 TGGCCCGCa GGCTGACCGC CCAACGACCC CCGCCCA7TG ACGTCAATAA 
1101 TGACGTATGT TCCCATAGTA ACGCCAATAG GGACTTTCCA TTGACGTCAA 
1151 TGGGTGGACr ATTTACGGTA AAQQCCCAC TTGQCAGTAC ATCAAGTCTA 
1201 TCATATQCCA AGTACGCCCC CTATTGACGT CAATGACQGT AAATGQCCCG 
1251 CaGGCATTA TGCCCAGTAC ATGACCTTAT GGGACmCC TACTTGGCAG 
1301 TACATCTACG TATTAGTCAT CQCTATTACC ATGGTGATGC GGnTTGGCA 
1351 GTACATCAAT GGGCGTGGAT AGCGGTTTGA CTCACGGGGA ITTCCAAGTC 
1401 TCCACCCCAT TGACGTCAAT GGGAGTriGT nTGGCACCA AAATCAACGG 
1451 GACTTTCCAA AATGTCGTAA CAAaCCGCC CCATTGACGC AAATGGQCGG 
1501 AAFCQGQG CQGGAaOGG GAGTQGCGAG CCQCAGATG CTGCATATAA 
1551 QCAOaGCTT TTTQCCTGTA CTGGGiaCT CIGGHAGAC CAGATCTGAG 
1601 CCTGGGAGCT QCTGCaAA CTAGAGAACC CAaGCTTAA GCCTCAATAA 
1651 AGCnCTAGA GATCCCTCGA CaCGAGGGA TGTCCATAC CTACCAGTTC 

FIG. 1 
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1701 TQCQCCTGCA GGTCQCQQCC QCGAQCTAG AGGATCTITG TGAAGGAACC 

1751 TTACTTCrGT QGTGTGACAT AAnOGACAA AQACCTACA GAGATTTAAA 

1801 GaCTAAGGT AAATATAAAA TmTAAGTG TATAATGTGT TAAAQAQG 

1851 AHQAAHG nTGTGTAn TTAGAnCCA ACCTATGGAA aCATGAATG 

1901 GGAGCAGTGG TOGAATGCQ HAATGAGGA AAACaGTTr TOaCAGAAG 

1951 AAATQCCATC TAGTGATGAT GAQGCTAaO QGAQaCA ACATTQACT 

2001 CCTCCAAAAA AGAAGAGAAA QGTAGAAGAC CCCAAQGACT rrCCTTCAGA 

2051 ATTOaAAGT TTTTTGAGTC ATGaCTGTT TAGTAATAGA AaOTGCTT 

2101 QCmOCTAT TTACACCACA AAGGAAAAAG aGCACTGCT ATACAAGAAA 

2151 ATTATGGAAA AATATTaCT AACCnTATA AGTAGQCATA ACAGTTATAA 

2201 TCATAACATA aGTmTTC HAaCCACA CAGGCATAGA GTGTCTGaA 

2251 TTAATAACTA TGCTCAAAAA nGTGTACCT TTAGCTTTTT AATHGTAAA 

2301 QOGGHAATA AGGAATATTT GATGTATAGT GCOTGACTA GAGATCATAA 

2351 TCAGCaTAC CACATTTGTA GAGGTTTTAC nGCTTTAAA AAACCTCCCA 

2401 CACaCCCCC TGAACaGAA ACATAAAATG AATGCAATTG TTGTTGTTAA 

2451 CTTGTTTAn GCAGCTTATA ATOGHACAA ATAAAQCAAT AQCATCACAA 

2501 ATTTCACAAA TAAAQCATTT TTnCACTGC ATTCTAGTTC TQGTTTGTCC 

2551 AAAQCATCA ATGTATCTTA TCATGTCTQG ATCQGIGGA ATGTGTGTCA 

2601 GTTAGQGrGT GGAAAGTCCC CAGQCTCCCC AGCAGGCAGA AGTATGCAAA 

2651 GCATGCATCT CAATTAGTCA GCAACCAGGT GTQGAAAGTC CCCAGGCTCC 

2701 CCAGCAGGCA GAAGTATGCA AAGCATGCAT QCAATTAGT CAGCAACCAT 

2751 AGTCCCGCCC CTAAaCCGC CCATCCCGCC CaAAQCCG CCCAGHCCG 

2801 CCaTTCTCC GCCCCATGGC TGAaAATTT TiTnATTTA TOCAGAQQCC 

2851 GAGGCCQCa CGQCaaCA QCTAnCCAG AAGTAGTGAG GAGGCnrrr 

2901 TOGAGGCQA QGCTTTTGCA AAAAGCTaat TC 

FIG. 1 
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1 GGCGTAATCT GCTGCHGCA AACAAAAAAA CCACCGCTAC CAGCGGTGGT 

51 rrGTITGCCG GATCAAGAGC TACCAACTCT TTTTCCGAAG GTAACTGGCT 

101 TCAGCAGAGC GCAGATACCA AATACTGTCC HCTAGIGTA GCCGTAGTTA 

151 GQCCACCACT TCAAGAAQC TGTAGCACCG CCTACATACC TCQCTaoa 

201 AATCaGHA CCAGTQQCTG aOCaGTGG CGATAAGTCG TGTCTTACCG 

251 GGnGGAQC AAGACGATAG TTACCGGATA AGGCGCAGCG GTCGGGCTGA 

301 ACQGGGQGTT CGTGCACACA GCCCAGCTTG GAGCGAACGA CaACACCGA 

351 AaCAGATAC CTACAGCGTG AGCATTGAGA AAGCGCaCG CHCCCGAAG 

401 GGAGAAAGGC GGACAGGTAT CCGGTAAGCG GCAGGGTCGG AACAGGAGAG 

451 CGCACGAGGG AGCHCCAGG GGGAAACGCC TCGTATCTTT ATAGTCCTGT 

501 CGGGITTCGC CACCTCTGAC nCAGCGTCG AmTTGTGA TGCTCGTCAG 

551 GGGGGCGGAG CCTATGGAAA AACGCCAGCA ACGCAAGCTA GCHCTAGCT 

601 AGAAATTGTA AACGHAATA mTGnAAA ATTCGCGTTA AATTTTTGTT 

651 AAAiCAGCTC AnTTTTAAC CAATAGGCCG AAATCQGCAA AATCCCTTAT 

701 AAATCAAAAG AATAGCCCGA GATAGGGTTG AGTGnGTTC CAGTTTGGAA 

751 CAAGAGTCCA CTAHAAAGA ACGTGGACTC CAACGTCAAA GGQCGAAAAA 

801 CCGTCTATCA GGGCGATGGC CGCCCAaAC GTGAACCATC ACCCAAATCA 

851 AGTmTTGG GGTCGAGGTG CCGTAAAGa CTAAATCGGA ACCCTAAAGG 

901 GAGCCCCCGA TTTAGAGCTT GACGGGGAAA GCCGGCGAAC GTGGCGAGAA 

951 AGGAAGGGAA GAAAGCGAAA GGAGCGGGCG CTAGGGCGCT GGCAAGTGTA 

1001 GCGGTCACGC TGCGCGTAAC CACCACACCC GCCGCGCTTA ATGCGCCGCT 

1051 ACAGGGCGCG TACTATQGTT GCTITGACGA QCACGTATAA CGTGCmCC 

FIG. » . 
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1101 TCGTTQ(yV^T CA(yUX:GQQA QCTAAACAG6 AGQCCGAm AA(XX^ 

1151 AGACAQGWC QGTACQCCAG aCGATCACC GCGGTOTTC TCAACGTAAC 

1201 ACnTACAQC GGCGCGTCAT HGATATGAT GCGCCCCCa TCCCGATAAG 

1251 QGAQCAGQCC AGTAAAAGCA TTACCCGTGG TQGGGTTCCC GAGCQGCCAA 

1301 AGGGAGCAGA CTaAAATCT GCCGTCATCG ACHCGAAGG TTCGAATCCT 

1351 TCCCCCACCA CCATCACTTT CAAAAGTCCG AAAGAAKTG aCCaoCTT 

1401 GTCTGTTGGA GGTCOaGAG TAGTGCaGA GTAAAATTTA AQCTACAACA 

1451 AGGCAAQQCT TGACCGACAA HGCATGAAG AATCTQCTTA GOGnAQGCG 

1501 TTTTQCGCTG CTTCGCGATG TACGGGCCAfi ATATACGCGT TGACAHGAT 

1551 TAHGAaAG TTATTAATAG TAATCAAHA CGQQGTCATT AGTTCATAQC 

1601 CCATATATGG ACnCCQCGT TACATAACTT ACGGTAAATG GCCCGCCTGG 

1651 CTGACCGCCC AACGACCCCC GCCCAnCAC GTCAATAATG ACGTATGTTC 

1701 CCATAGTAAC GCCAATAGGG ACTTTCCAn GACGTCAATG GGTGGACTAT 

1751 TTACGGTAAA aGCCCAOT GGCAGTACAT CAAGTGTATC ATATQCCAAG 

1801 TACQCCCCa AHGACCTCA ATGACQGTAA ATGQCCCQCC TQQCATTATG 

1851 CCCAGTACAT GACOTATGG GACTTTCCTA CTTGGCAGTA CATaACGTA 

1901 TTAGTCATCG CTAHACCAT GGTGATQCQG TnTOXAGT ACATCAATQG 

1951 QCGTQGATAG CGGTITGCr CACGGGGAH TCCAAGTCTC CACCCCATTG 

2001 ACGTCAATQG GAGTTTGTTT TGQCACCAAA ATCAACGQGA CTITCCAAAA 

2051 TGTCGTAACA AQCCGCCCC A7TGACGCAA ATGGGCGGAA HCaGGGCG 

2101 GGAQGGGGA GTGGCGAGCC CTCAGATCa GCATATAAGC AGaGCmT 

2151 TGCCTGTACT GGGTCTaCT GGTTAGACCA GATaGAQCC TGGGAGCTCT 

2201 aGGCTAACT AGAGAACCCA aGCTTAAGC CTCAATAAAG CTTCTAGAGA 

2251 TCCaCGACC TCGAGATCCA HCTGaGQC GCGGATrOT TATCAQGAT 

FIG. ^ (con't) 
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■ 2301 AAGnCGTGG ACATAHATG TTTATCAGTG ATAAAGTGTC AAGCATGACA 

2351 AACnGCAGC CGAATACAGT GATCCGTGCC GCCCTAGACC TGHGAACGA 

2401 GGTCGGCGTA GACGGTCTGA CGACACGCAA AaOGCGGAA CGGnCGGGG 

2451 nCAGCAGCC GGCGCTTTAC TGGCAOTCA GGAACAAGCG GGCGaCCTC 

2501 GACGCACTGG CCGAAGCCAT GaCGCGGAG AATCATAGCA GTCGGTGCC 

2551 GAGAGCCGAC GACGACTGGC GaCATTTCT GACTGGGAAT GCCCGCAGCT 

2601 TCAGGCAGGC GCTGCTCGCC TACCGCCAGC ACAATGGATC TCGAGGGATC 

2651 nCCATACCT ACCAGTTaC CGCCTGCAGG TCGCGGCCGC GACTCTAGAG 

2701 GATCnTGTG AAGGAACCH ACTTCTGTGG TGTGACATAA HGGACAAAC 

2751 TACCTACAGA GATTTAAAGC TCTAAGGTAA ATATAAAAH IITAAGTGTA 

2801 TAATCTGHA AACTACTGAT TCTAATTGIT TGIGTATTTT AGAHCCAAC 

2851 CTATGGAACT GATGAATGGG AGCAGTGGTG GAATGCCTTT AATGAGGAAA 

2901 ACCTGTTTrG CTCAGAAGAA ATGCaTCTA GTGATGATGA GGCTACTQCT 

2951 GACTCTCAAC AnCTACTCC TCCAAAAAAG AAGAGAAAGG TAGAAGACCC 

3001 CAAGGACTTT CCHCAGAAT TGCTAAGTTT TTTGAGTCAT GCTGTGTTTA 

3051 GTAATAGAAC TCnGCTTGC mGCTATTT ACACCACAAA GGAAAAAGCT 

3101 GCACTGCTAT ACAAGAAAAT TATGGAAAAA TATTCTGTAA CC7TTATAAG 

3151 TAGGCATAAC AGHATAATC ATAACATACT GrnTfTCTT ACTCCACACA 

3201 GGCATAGAGT GTCTGaAH AATAACTATG CTCAAAAATT GTGTACCTTT 

3251 AGCmTTAA TrTGTAAAGG GGTTAATAAG GAATATTTGA TGTATAGTGC 

3301 CTTGACTAGA GATCATAATC AGCCATACCA CAnTGTAGA GGTTTTACTr 

3351 GCTTTAAAAA ACCTCCCACA CCTCCCCCTG AACCTGAAAC ATAAAATGAA 

3401 TGCAAHGTr GHGnAACT IGTHAHGC AGCTTATAAT GGTTACAAAT 

3451 AAAGCAATAG CATCACAAAT HCACAAATA AAGCATTm HCACTGCAT 

FIG. ® ^''^"'^^ 



20 



7/11/2007, EAST Version: 2.0.3.0 



EP 0 739 980 A2 



3501 TCTAGTTGTG GTTTGTCCAA AQCATCMT GTATCTTATC ATGiaCGAT 

3551 CaGTQGAAT GTGT6TCAGT TAGGGTGTQG AAAGTCCCa QOaCCCCAG 

3601 CAGGCAGAAG TATQCAAAGC ATGCATCTCA AHAGTCAGC AACCAGGTGT 

3651 QGAAAGTCCC CAGGaCCCC AGCAGGCAGA AGTATGCAAA GCATGCATa 

3701 CAAHAGTCA GCAACCATAG TCCCGCCCCT AACTCCGCCC ATCCCGCCCC 

3751 TAACTCCGCC CAGHCCGCC CATTCTCCGC CCCATGGCTG ACTAATmT 

3801 nTATTTATG CAGAGGCCGA GGCCGCCTCG GCCTCTGAGC TAHCCAGAA 

3851 GTAGTGAGGA GGCirmTG GAGGCCTAGG CTTTTGCAAA AAGQAAHC 

FIG- ® 
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1 . .GGAGAGTC TGACCACCAT GCCACCTCCT CGCCTCCTCT TCHCCTCCT 

51 CnCCTCACC CCCATGGAAG TCAGGCCCGA GGAACCTCTA GTGGTGAAGG 

101 TGGMGAGGG AGATAACGCT GTGCTGCAGT GCCTCAAGGG GACCTCAGAT 

151 GGCCCCAaC AQCACaGAC CTGGTCTCGG GAGTCCCCGC TTAAACCCTT 

201 OTAAAACTC AGCCTGGGGC TGCCAQGCa GGGAATCCAC ATGAGGCCCC 

251 TGGcCATCTG GCTTnCATC TTCAACGTCT CTCAACAGAT GOGGGGGTC 

301 TACCTGTGCC AGCCGGGGCC CCCCTCTGAG AAGGCCTGGC AGCCTGGCTG 

351 GACAGTCAAT GTGC^GGGCA GCGGGGAGCT GnCCGGTGG AATGTTTCGG 

401 ACCTAGGTGG CCTGGGCTGT GGCCTGAAGA ACAGGTCCTC AGAGGGCCCC 

451 AGCTCCCCn CCGGGAAGCT CATGAGCCCC AAGCTGTATG TGTGGGCCAA 

501 AGACCGCCCT GAGATCTGGG AGGGAGAGCC TCCGTGTGTC CCACCGAGGG 

551 ACAGCCTGAA CCAGAGCCTC AGCCAGGACC TCACCATGGC CCCTGGCTCC 

601 ACACTCTGGC TGTCCTGTGG GGTACCCCCT GACTCTGTGT CCAGGGGCCC 

651 CCTCTCCTGG ACCCATGTGC ACCCCAAGGG GCCTAAGTCA TTGCTGAGCC 

701 TAGAGCTGAA GGACGATCGC CCGGCCAGAG ATATGTGGGT AATGGAGACG 

751 GGTCTGHGT TGCCCCGGGC CACAGCTCAA GACGCTGGAA AGTAHAHG 

801 TCACCGTGGC AACCTGACCA TGTCATTCCA CCTGG-AGATC ACTGCTCGGC 

851 CAGTACTATG GCACTGGCTG CTGAGGACTG GTGGCTGGAA GGTCTCAGCT 

901 GTGACTTTGG CTTATaGAT CTTCTGCaG TGHCCCTTG TGGGC^^CT 

951 TCATCnCAA AGAGCCCTGG TCCTGAGG\G GAAAAGAAAG Ca\ATGACTG 

1001 ACCCCACCAG GAGATTCTTC AAAGTGACGC CTCCCCCAGG AAGCGGGCCC 

1051 CAGAACCAGT ACGGGAACGT GCTGTCTCTC CCCACACCCA CCTCAGGCCT 

1101 CGGACGCGCC CAGCGTTGGG CCGCAGGCCT GGGGGGCACT GCCCCGTCTT 

1151 ATGGAAACCC.GAGCAGCGAC GTCCAGGCGG ATGG^GCCTT GGGGTCCCGG 
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CACACQGAC 
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AGCAGATGAT 
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GGAGCAATGT 


TGCTTAGGAT 
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GTGTGCATGT 


GTGTAAGTGT 


GTGTGTGTGT 


GTGTGTGTGT 


GTGTGTGTGT 
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